Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for recent submissions

  • Tue, 3 Jun 2025
  • Mon, 2 Jun 2025
  • Fri, 30 May 2025
  • Thu, 29 May 2025
  • Wed, 28 May 2025

See today's new changes

Total of 985 entries : 1-50 51-100 101-150 151-200 ... 951-985
Showing up to 50 entries per page: fewer | more | all

Tue, 3 Jun 2025 (showing first 50 of 308 entries )

[1] arXiv:2506.01954 [pdf, html, other]
Title: DRAG: Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillation
Jennifer Chen, Aidar Myrzakhan, Yaxin Luo, Hassaan Muhammad Khan, Sondos Mahmoud Bsharat, Zhiqiang Shen
Comments: ACL 2025 Main. Code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2] arXiv:2506.01952 [pdf, other]
Title: WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks
Atsuyuki Miyai, Zaiying Zhao, Kazuki Egashira, Atsuki Sato, Tatsumi Sunada, Shota Onohara, Hiromasa Yamanishi, Mashiro Toyooka, Kunato Nishina, Ryoma Maeda, Kiyoharu Aizawa, Toshihiko Yamasaki
Comments: Project Page: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3] arXiv:2506.01951 [pdf, html, other]
Title: Self-ensemble: Mitigating Confidence Distortion for Large Language Models
Zicheng Xu, Guanchu Wang, Guangyao Zheng, Yu-Neng Chuang, Alexander Szalay, Xia Hu, Vladimir Braverman
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[4] arXiv:2506.01939 [pdf, other]
Title: Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Shenzhi Wang, Le Yu, Chang Gao, Chujie Zheng, Shixuan Liu, Rui Lu, Kai Dang, Xionghui Chen, Jianxin Yang, Zhenru Zhang, Yuqiong Liu, An Yang, Andrew Zhao, Yang Yue, Shiji Song, Bowen Yu, Gao Huang, Junyang Lin
Comments: 25 pages, 17 figures, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[5] arXiv:2506.01938 [pdf, html, other]
Title: Novel Benchmark for NER in the Wastewater and Stormwater Domain
Franco Alberto Cardillo, Franca Debole, Francesca Frontini, Mitra Aelami, Nanée Chahinian, Serge Conrad
Subjects: Computation and Language (cs.CL)
[6] arXiv:2506.01937 [pdf, html, other]
Title: RewardBench 2: Advancing Reward Model Evaluation
Saumya Malik, Valentina Pyatkin, Sander Land, Jacob Morrison, Noah A. Smith, Hannaneh Hajishirzi, Nathan Lambert
Comments: Data, models, and leaderboard available at this https URL
Subjects: Computation and Language (cs.CL)
[7] arXiv:2506.01928 [pdf, html, other]
Title: Esoteric Language Models
Subham Sekhar Sahoo, Zhihan Yang, Yash Akhauri, Johnna Liu, Deepansha Singh, Zhoujun Cheng, Zhengzhong Liu, Eric Xing, John Thickstun, Arash Vahdat
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[8] arXiv:2506.01920 [pdf, html, other]
Title: From Guidelines to Practice: A New Paradigm for Arabic Language Model Evaluation
Serry Sibaee, Omer Nacar, Adel Ammar, Yasser Al-Habashi, Abdulrahman Al-Batati, Wadii Boulila
Subjects: Computation and Language (cs.CL)
[9] arXiv:2506.01918 [pdf, html, other]
Title: Spatial Coordinates as a Cell Language: A Multi-Sentence Framework for Imaging Mass Cytometry Analysis
Chi-Jane Chen, Yuhang Chen, Sukwon Yun, Natalie Stanley, Tianlong Chen
Subjects: Computation and Language (cs.CL)
[10] arXiv:2506.01872 [pdf, html, other]
Title: Is Extending Modality The Right Path Towards Omni-Modality?
Tinghui Zhu, Kai Zhang, Muhao Chen, Yu Su
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[11] arXiv:2506.01859 [pdf, other]
Title: CONFETTI: Conversational Function-Calling Evaluation Through Turn-Level Interactions
Tamer Alkhouli, Katerina Margatina, James Gung, Raphael Shu, Claudia Zaghi, Monica Sunkara, Yi Zhang
Comments: ACL 2025 (main conference)
Subjects: Computation and Language (cs.CL)
[12] arXiv:2506.01846 [pdf, other]
Title: Code-Switching and Syntax: A Large-Scale Experiment
Igor Sterner, Simone Teufel
Comments: Findings of ACL 2025
Subjects: Computation and Language (cs.CL)
[13] arXiv:2506.01840 [pdf, other]
Title: Minimal Pair-Based Evaluation of Code-Switching
Igor Sterner, Simone Teufel
Comments: ACL 2025
Subjects: Computation and Language (cs.CL)
[14] arXiv:2506.01829 [pdf, html, other]
Title: CiteEval: Principle-Driven Citation Evaluation for Source Attribution
Yumo Xu, Peng Qi, Jifan Chen, Kunlun Liu, Rujun Han, Lan Liu, Bonan Min, Vittorio Castelli, Arshit Gupta, Zhiguo Wang
Comments: ACL 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[15] arXiv:2506.01819 [pdf, html, other]
Title: Not All Jokes Land: Evaluating Large Language Models Understanding of Workplace Humor
Moahmmadamin Shafiei, Hamidreza Saffari
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[16] arXiv:2506.01817 [pdf, html, other]
Title: BD at BEA 2025 Shared Task: MPNet Ensembles for Pedagogical Mistake Identification and Localization in AI Tutor Responses
Shadman Rohan, Ishita Sur Apan, Muhtasim Ibteda Shochcho, Md Fahim, Mohammad Ashfaq Ur Rahman, AKM Mahbubur Rahman, Amin Ahsan Ali
Subjects: Computation and Language (cs.CL)
[17] arXiv:2506.01814 [pdf, html, other]
Title: Analysis of LLM Bias (Chinese Propaganda & Anti-US Sentiment) in DeepSeek-R1 vs. ChatGPT o3-mini-high
PeiHsuan Huang, ZihWei Lin, Simon Imbot, WenCheng Fu, Ethan Tu
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[18] arXiv:2506.01808 [pdf, html, other]
Title: NAVER LABS Europe Submission to the Instruction-following Track
Beomseok Lee, Marcely Zanon Boito, Laurent Besacier, Ioan Calapodescu
Subjects: Computation and Language (cs.CL)
[19] arXiv:2506.01807 [pdf, html, other]
Title: Propaganda and Information Dissemination in the Russo-Ukrainian War: Natural Language Processing of Russian and Western Twitter Narratives
Zaur Gouliev
Comments: 7 pages; 6 figures
Subjects: Computation and Language (cs.CL)
[20] arXiv:2506.01796 [pdf, html, other]
Title: Read it in Two Steps: Translating Extremely Low-Resource Languages with Code-Augmented Grammar Books
Chen Zhang, Jiuheng Lin, Xiao Liu, Zekai Zhang, Yansong Feng
Comments: ACL 2025
Subjects: Computation and Language (cs.CL)
[21] arXiv:2506.01793 [pdf, html, other]
Title: Human-Centric Evaluation for Foundation Models
Yijin Guo, Kaiyuan Ji, Xiaorong Zhu, Junying Wang, Farong Wen, Chunyi Li, Zicheng Zhang, Guangtao Zhai
Subjects: Computation and Language (cs.CL)
[22] arXiv:2506.01784 [pdf, html, other]
Title: iQUEST: An Iterative Question-Guided Framework for Knowledge Base Question Answering
Shuai Wang, Yinan Yu
Comments: Accepted to ACL 2025 (Main)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[23] arXiv:2506.01776 [pdf, other]
Title: MaXIFE: Multilingual and Cross-lingual Instruction Following Evaluation
Yile Liu, Ziwei Ma, Xiu Jiang, Jinglu Hu, Jing Chang, Liang Li
Comments: ACL 2025 Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[24] arXiv:2506.01775 [pdf, html, other]
Title: Developing a Mixed-Methods Pipeline for Community-Oriented Digitization of Kwak'wala Legacy Texts
Milind Agarwal, Daisy Rosenblum, Antonios Anastasopoulos
Comments: Accepted to Comput-EL 2025 Workshop. Preprint
Subjects: Computation and Language (cs.CL)
[25] arXiv:2506.01748 [pdf, html, other]
Title: Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning
Yihong Tang, Kehai Chen, Muyun Yang, Zhengyu Niu, Jing Li, Tiejun Zhao, Min Zhang
Subjects: Computation and Language (cs.CL)
[26] arXiv:2506.01734 [pdf, html, other]
Title: Benford's Curse: Tracing Digit Bias to Numerical Hallucination in LLMs
Jiandong Shao, Yao Lu, Jianfei Yang
Comments: Under Review
Subjects: Computation and Language (cs.CL)
[27] arXiv:2506.01732 [pdf, html, other]
Title: Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training
Pierre-Carl Langlais, Carlos Rosas Hinostroza, Mattia Nee, Catherine Arnett, Pavel Chizhov, Eliot Krzystof Jones, Irène Girard, David Mach, Anastasia Stasenko, Ivan P. Yamshchikov
Subjects: Computation and Language (cs.CL)
[28] arXiv:2506.01723 [pdf, html, other]
Title: Tug-of-war between idiom's figurative and literal meanings in LLMs
Soyoung Oh, Xinting Huang, Mathis Pink, Michael Hahn, Vera Demberg
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[29] arXiv:2506.01713 [pdf, html, other]
Title: SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning
Zhongwei Wan, Zhihao Dou, Che Liu, Yu Zhang, Dongfei Cui, Qinjian Zhao, Hui Shen, Jing Xiong, Yi Xin, Yifan Jiang, Yangfan He, Mi Zhang, Shen Yan
Comments: Under review
Subjects: Computation and Language (cs.CL)
[30] arXiv:2506.01710 [pdf, html, other]
Title: Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning
Fangyu Lei, Jinxiang Meng, Yiming Huang, Tinghong Chen, Yun Zhang, Shizhu He, Jun Zhao, Kang Liu
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[31] arXiv:2506.01709 [pdf, html, other]
Title: Fairness Dynamics During Training
Krishna Patel, Nivedha Sivakumar, Barry-John Theobald, Luca Zappella, Nicholas Apostoloff
Subjects: Computation and Language (cs.CL)
[32] arXiv:2506.01702 [pdf, html, other]
Title: mdok of KInIT: Robustly Fine-tuned LLM for Binary and Multiclass AI-Generated Text Detection
Dominik Macko
Subjects: Computation and Language (cs.CL)
[33] arXiv:2506.01698 [pdf, html, other]
Title: When LLMs Team Up: The Emergence of Collaborative Affective Computing
Wenna Lai, Haoran Xie, Guandong Xu, Qing Li, S. Joe Qin
Comments: 20 pages, 7 figures, and 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[34] arXiv:2506.01687 [pdf, html, other]
Title: StochasTok: Improving Fine-Grained Subword Understanding in LLMs
Anya Sims, Thom Foster, Klara Kaleb, Tuan-Duy H. Nguyen, Joseph Lee, Jakob N. Foerster, Yee Whye Teh, Cong Lu
Subjects: Computation and Language (cs.CL)
[35] arXiv:2506.01675 [pdf, html, other]
Title: Cross-Lingual Transfer of Cultural Knowledge: An Asymmetric Phenomenon
Chen Zhang, Zhiyuan Liao, Yansong Feng
Comments: ACL 2025
Subjects: Computation and Language (cs.CL)
[36] arXiv:2506.01646 [pdf, html, other]
Title: ESGenius: Benchmarking LLMs on Environmental, Social, and Governance (ESG) and Sustainability Knowledge
Chaoyue He, Xin Zhou, Yi Wu, Xinjia Yu, Yan Zhang, Lei Zhang, Di Wang, Shengfei Lyu, Hong Xu, Xiaoqiao Wang, Wei Liu, Chunyan Miao
Comments: 37 pages, 8 figures, 11 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[37] arXiv:2506.01629 [pdf, html, other]
Title: Cross-Lingual Generalization and Compression: From Language-Specific to Shared Neurons
Frederick Riemenschneider, Anette Frank
Comments: Paper accepted for publication at ACL 2025 Main; 10 pages, 20 figures, 4 tables
Subjects: Computation and Language (cs.CL)
[38] arXiv:2506.01627 [pdf, html, other]
Title: MVAN: Multi-View Attention Networks for Fake News Detection on Social Media
Shiwen Ni, Jiawen Li, Hung-Yu Kao
Subjects: Computation and Language (cs.CL)
[39] arXiv:2506.01621 [pdf, html, other]
Title: Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data
Zixiao Zhu, Kezhi Mao
Comments: 13 pages, 2 figures
Subjects: Computation and Language (cs.CL)
[40] arXiv:2506.01615 [pdf, other]
Title: IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
Pasunuti Prasanjith, Prathmesh B More, Anoop Kunchukuttan, Raj Dabre
Comments: WIP
Subjects: Computation and Language (cs.CL)
[41] arXiv:2506.01602 [pdf, html, other]
Title: MMD-Sense-Analysis: Word Sense Detection Leveraging Maximum Mean Discrepancy
Kensuke Mitsuzawa
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[42] arXiv:2506.01592 [pdf, html, other]
Title: Statement-Tuning Enables Efficient Cross-lingual Generalization in Encoder-only Models
Ahmed Elshabrawy, Thanh-Nhi Nguyen, Yeeun Kang, Lihan Feng, Annant Jain, Faadil Abdullah Shaikh, Jonibek Mansurov, Mohamed Fazli Mohamed Imam, Jesus-German Ortiz-Barajas, Rendi Chevi, Alham Fikri Aji
Comments: Accepted to ACL 2025 (Findings)
Subjects: Computation and Language (cs.CL)
[43] arXiv:2506.01587 [pdf, html, other]
Title: Unified Large Language Models for Misinformation Detection in Low-Resource Linguistic Settings
Muhammad Islam, Javed Ali Khan, Mohammed Abaker, Ali Daud, Azeem Irshad
Subjects: Computation and Language (cs.CL)
[44] arXiv:2506.01578 [pdf, other]
Title: Prompt Engineering Large Language Models' Forecasting Capabilities
Philipp Schoenegger, Cameron R. Jones, Philip E. Tetlock, Barbara Mellers
Subjects: Computation and Language (cs.CL)
[45] arXiv:2506.01565 [pdf, html, other]
Title: Hanfu-Bench: A Multimodal Benchmark on Cross-Temporal Cultural Understanding and Transcreation
Li Zhou, Lutong Yu, Dongchu Xie, Shaohuan Cheng, Wenyan Li, Haizhou Li
Comments: cultural analysis, cultural visual understanding, cultural image transcreation
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[46] arXiv:2506.01535 [pdf, html, other]
Title: Dictionaries to the Rescue: Cross-Lingual Vocabulary Transfer for Low-Resource Languages Using Bilingual Dictionaries
Haruki Sakajo, Yusuke Ide, Justin Vasselli, Yusuke Sakai, Yingtao Tian, Hidetaka Kamigaito, Taro Watanabe
Comments: Accepted to ACL 2025 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[47] arXiv:2506.01531 [pdf, other]
Title: STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent Framework
Wenhao Liu, Zhenyi Lu, Xinyu Hu, Jierui Zhang, Dailin Li, Jiacheng Cen, Huilin Cao, Haiteng Wang, Yuhan Li, Kun Xie, Dandan Li, Pei Zhang, Chengbo Zhang, Yuxiang Ren, Xiaohong Huang, Yan Ma
Comments: accepted by ACL2025
Subjects: Computation and Language (cs.CL)
[48] arXiv:2506.01524 [pdf, html, other]
Title: V-VAE: A Variational Auto Encoding Framework Towards Fine-Grained Control over Human-Like Chat
Qi Lin, Weikai Xu, Lisi Chen, Bin Dai
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[49] arXiv:2506.01520 [pdf, html, other]
Title: FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents
Bobo Li, Yuheng Wang, Hao Fei, Juncheng Li, Wei Ji, Mong-Li Lee, Wynne Hsu
Comments: 8 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[50] arXiv:2506.01512 [pdf, html, other]
Title: Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes
Meng Li, Michael Vrazitulis, David Schlangen
Comments: accepted by ACL 2025 (main)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Total of 985 entries : 1-50 51-100 101-150 151-200 ... 951-985
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack